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-- The MAILING DATE of this communication appears on the cover sheet with the correspondence address - 
Period for Reply 



A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 1 33). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1 )^ Responsive to communication(s) filed on August 20, 2004 . 
2a)(3 This action is FINAL. 2b)Q This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 

Disposition of Claims 

4) £3 Claim(s) 1-4,6-11 and 13-27 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

6) |EI Claim(s) 1-4,6-11 and 13-27 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) Q Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10)[3 The drawing(s) filed on 19 July 2001 is/are: a)KI accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 

Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1 .121(d). 
1 1 )□ The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-1 52. 

Priority under 35 U.S.C. § 119 

12)D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 
a)D All b)Q Some * c)D None of: 

1 IZI Certified copies of the priority documents have been received. 

20 Certified copies of the priority documents have been received in Application No. . 

30 Copies of the certified copies of the priority documents have been received in this National Stage 
application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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Final Action 
Response to Amendments/Arguments 

1. Applicant's arguments, see (pages 6-7), filed August 20, 2004, with respect to 
the rejection(s) of claims 1-4/6-11 and 13-27 under Edwards et al (U S 6,545,706 B1); 
Abdel-Mottaleb et al (U S 6,263,113 B1); Ippolito et al (U S 6,072,522); and Uehara et al 
(U S 4,961,177) have been fully considered and are not persuasive. Therefore, This 
Office Action Made Final). 

2. Applicant argues in essence that the prior art does not teach or suggest 
removing flesh color from captured video. Examiner disagrees and indicates that Abdel- 
Mottaleb teaches removing flesh color from captured video (column 3, lines 24-27; and 
60-66). 

3. Applicant argues in essence that the prior art does not teach or suggest 
determining a user's position from captured image of the speaker. Examiner disagrees 
and indicates that Uehara teaches determining a user's position from captured image of 
the speaker (column 1, lines 64-67; column 2, lines 1-4; column 4, lines 12-25 and 35- 
46). 

4. Applicant argues in essence that the prior art does not teach or suggest using 
the image captured by the video capture device to adjust the data from each 
microphone. Examiner disagrees and indicates that Uehara teaches using the image 
captured by the video capture device to adjust the data from each microphone (column 
3, lines 28-32). 



Application/Control Number: 09/750,524 
Art Unit: 2625 



Page 3 



Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

6. Claims 1-4, 6-1 land 13-17 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Edwards et al (U S 6,545,706 B1 ) in view of Abdel-Mottaleb et al (U 
S 6,263,113 B1). 

Regarding claim 1, Edwards discloses system, method and article of 
manufacture for tracking a head of a camera-generated of a person comprising: 

detecting a color characteristic (column 8, lines 24-27); 

detecting motion (column 7, lines 43-47). 
However, Edwards is silent about specific details regarding the step of: 

removing flesh color from captured video. 
In the same field (face detection) of endeavor, however, Abdel-Mottaleb discloses 
method for detecting a face in a digital image comprising removing the flesh color from 
the captured video (see abstract and column 3, lines 24-27 and 60-66). 
It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the step of removing the flesh color from the captured video 
as taught by Abdel-Mottaleb in the system of Edwards because Abdel-Mottaleb 
provides Edwards a face detection system which is useful for security systems, criminal 
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identifications, digital image capturing and teleconferences. In a security system, for 
example it is useful to detect the facial portions of an image being viewed so that an 
operator of the system can discern whether a human is present in the image and also 
this method provides a digital image composed of a plurality of pixels and producing 
binary image from the digital image by detecting skin colored pixels. This method 
includes removing pixels corresponding to edges in the luminance component thereby 
producing binary image components. 

Regarding claim 2, Edwards discloses system, method and article of 
manufacture for tracking a head of a camera-generated of a person including controlling 
a processor-based system based on the detection of flesh color and the detection of a 
shape associated with a human being (column 4, lines 51-55 and column 8, lines 46- 
53). 

Regarding claim 3, Edwards discloses system, method and article of 
manufacture for tracking a head of a camera-generated of a person including 
determining whether to process image data depending on whether both motion and 
flesh color are detected (column 7, lines 43-47 and column 8, lines 46-56). 

Regarding claim 4, Edwards discloses system, method and article of 
manufacture for tracking a head of a camera-generated of a person including capturing 
a frame of video at a time, and determining after capturing each frame whether or not 
flesh color has been detected (column 6, lines 37-41 and column 10, lines 11-18). 

Regarding claim 6, Edwards discloses system, method and article of 
manufacture for tracking a head of a camera-generated of a person moving an 
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animation object while capturing video and removing the detected flesh color from the 
captured video (column 5, lines 41). 

However, Edwards is silent about specific details regarding the step of: 

removing the flesh color from the captured video. 
In the same field (face detection) of endeavor, however, Abdel-Mottaleb discloses 
method for detecting a face in a digital image comprising removing the flesh color from 
the captured video (see abstract and column 3, lines 62-66). 
It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the step of removing the flesh color from the captured video 
as taught by Abdel-Mottaleb in the system of Edwards because Abdel-Mottaleb 
provides Edwards a face detection system which is useful for security systems, criminal 
identifications, digital image capturing and teleconferences. In a security system, for 
example it is useful to detect the facial portions of an image being viewed so that an 
operator of the system can discern whether a human is present in the image and also 
this method provides a digital image composed of a plurality of pixels and producing 
binary image from the digital image by detecting skin colored pixels. This method 
includes removing pixels corresponding to edges in the luminance component thereby 
producing binary image components. 

Regarding claim 7, Edwards discloses system, method and article of 
manufacture for tracking a head of a camera-generated of a person including capturing 
video of an animation object in a plurality of different positions and automatically 
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removing an image of a user's hand from the captured video (column 2, lines 12-18 and 
41-52). 

Claim 8, is similarly analyzed as claims 1 above. 

Claim 9, is similarly analyzed as claims 2 above. 

Claim 10, is similarly analyzed as claims 3 above. 

Claim 1 1 , is similarly analyzed as claims 4 above. 

Claim 13, is similarly analyzed as claims 7 above. 

Regarding claim 14, Edwards discloses system, method and article of 
manufacture for tracking a head of a camera-generated of a person comprising: 

a processor (fig. 1 element 110 and column 4, lines 51-56). 

a storage coupled (fig. 1 element 120 and column 4, lines 51-56) to said 
processor-storing instructions that enable the processor to detect motion (column 3, 
lines 14-16) and a skin color characteristic (column 8, lines 24-27) and remove the skin 
color characteristic from captured video. 

However, Edwards is silent about specific details regarding the step of: 

remove the color characteristic from the captured video. 
In the same field (face detection) of endeavor, however, Abdel-Mottaleb discloses 
method for detecting a face in a digital image comprising remove the color characteristic 
from the captured video (see abstract and column 3, lines 62-66). 
It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the step of removing the flesh color from the captured video 
as taught by Abdel-Mottaleb in the system of Edwards because Abdel-Mottaleb 
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provides Edwards a face detection system which is useful for security systems, criminal 
identifications, digital image capturing and teleconferences. In a security system, for 
example it is useful to detect the facial portions of an image being viewed so that an * 
operator of the system can discern whether a human is present in the image and also 
this method provides a digital image composed of a plurality of pixels and producing 
binary image from the digital image by detecting skin colored pixels. This method 
includes removing pixels corresponding to edges in the luminance component thereby 
producing binary image components. 

Claim 15, is similarly analyzed as claims 2 above. 

Claim 16, is similarly analyzed as claims 3 above. 

Regarding claim 17, Edwards is silent about specific details regarding the step 

of: 

a digital imaging device coupled to said processor (column 3, lines 62-67 and 
column 5, lines 25-44). 

In the same field (face detection) of endeavor, however, Abdel-Mottaleb discloses 
method for detecting a face in a digital image comprising remove the color characteristic 
from the captured video (see column 6, lines 20-27). 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the step of a digital imaging device as taught by Abdel- 
Mottaleb in the system of Edwards because Abdel-Mottaleb provides Edwards a face 
detection system which is useful for security systems, criminal identifications, digital 
image capturing and teleconferences. In a security system, for example it is useful to 
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detect the facial portions of an image being viewed so that an operator of the system 
However, Edwards is silent about specific details regarding the step of: However, 
Edwards is silent about specific details regarding the step of:can discern whether a 
human is present in the image and also this method provides a digital image composed 
of a plurality of pixels and producing binary image from the digital image by detecting 
skin colored pixels. This method includes removing pixels corresponding to edges in the 
luminance component thereby producing binary image components. 
7. Claims 18, 20-27 rejected under 35 U.S.C. 103(a) as being unpatentable over 
Ippolito et al (U S 6,072,522) in view of Uehara et al (U S 4,961,177). 

Regarding claim 18, Ippolito discloses video conferencing system for group video 
conferencing comprising: 

capturing a video image of a speaker (column 13, lines 46-52); 

receiving audio information from the speaker through at least one 
microphone(Column 5, lines 30-40). 

However, Ippolito is silent about specific details regarding the steps of: 

determining the user's position using said captured video image of said speaker; 

and, 

based on the user's position, adjusting a characteristic of the microphone. 
In the same field of endeavor, however, Uehara discloses a method and system for 
inputting a voice through a microphone comprising the steps of: 

determining the user's position using said captured video image of said speaker; 
(column 1, lines 64-67; column 2, lines 1-4; column 4, lines 3-25 and 35-46); and, 
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based on the user's position, adjusting a characteristic of the microphone 
(column 3, lines 28-32). 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the steps of user's position and adjusting microphone as 
taught by Uehara in the system of Ippolito because Uehara provides Ippolito a system 
which can collect voice data from a person at a high S/N ratio without impairing the 
usefulness and operability of the voice input system. 

Regarding claim 20, Ippolito is silent about the method wherein including tracking 
the user's facial position in two dimensions and estimating the user's facial position in a 
third dimension. 

In the same field of endeavor, however, Uehara discloses a method and system for 
inputting a voice through a microphone comprising the steps of: 
tracking the user's facial position in two dimensions and estimating the user's facial 
position in a third dimension (column 4, lines 12-22). 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the step of tracking the user's facial position in three 
dimensions as taught by Uehara in the system of Ippolito because Uehara provides 
Ippolito a system which can collect voice data from a person at a high S/N ratio without 
impairing the usefulness and operability of the voice input system. 

Regarding claim 21 , Ippolito is silent about the method wherein including tracking 
the user's facial position in three dimensions. 
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In the same field of endeavor, however, Uehara discloses a method and system for 
inputting a voice through a microphone comprising the steps of: 
tracking the user's facial position in three- dimensions (column 4, lines 12-22). 
It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the step of tracking the user's facial position in three 
dimensions as taught by Uehara in the system of Ippolito because Uehara provides 
Ippolito a system which can collect voice data from a person at a high S/N ratio without 
impairing the usefulness and operability of the voice input system. 

Regarding claim 22, Ippolito is silent about the method wherein including using a 
point of source filter to adjust the audio information received from the user and providing 
said adjusted audio information to a speech recognition engine. 
In the same field of endeavor, however, Uehara discloses a method and system for 
inputting a voice through a microphone comprising the step of: 
adjusted audio information (column 4, lines 23-35 and column 5, lines 5-15). 
It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use the step of adjusted audio information as taught by Uehara 
in the system of Ippolito because Uehara provides Ippolito a system which can collect 
voice data from a person at a high S/N ratio without impairing the usefulness and 
operability of the voice input system. 

Claim 23, is similarly analyzed as claims 18 above. 
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Regarding claim 24, Ippolito discloses the system wherein including a pair of 
video cameras for capturing an image of said user (fig. 1 1 element 130 and column 4, 
lines 9-24; column 15, lines 47-52). 

Claim 25, is similarly analyzed as claims 20 above. 

Claim 26, is similarly analyzed as claims 21 above. 

Claim 27, is similarly analyzed as claims 22 above. 
8. Claim 19 is rejected under 35 U.S.C. 103(a) as being unpatentable over Ippolito 
et al (U S 6,072,522) and Uehara et al (U S 4,961 ,177) as applied to claims 18 and 23 
above, and further in view of Salisbury (U S 5,917,775). 

Regarding claim 19, Ippolito and Uehara are silent about the method wherein 
including receiving audio information from a pair of microphones and adjusting the 
sensitivity of the microphones based on the relative positioning of the user with respect 
to each microphone. 

In the same field of endeavor, however, Salibury discloses adjusting the sensitivity of 
the microphone based on the relative positioning of the user with respect to each 
microphone (column 4, lines 33-39). 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to use adjusting the sensitivity of the microphone based on the 
relative positioning of the user with respect to each microphone as taught by Salisbury 
in the system of Ippolito because Salisbury provides Ippolito a system which a 
microphone for detecting pre-set sensitivity levels and the system is capable to 
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detecting the discharge of a firearm and transmitting an alerting signals to a 
predetermined location of discharge. 

Claim Rejections - 35 USC § 102 

9. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

10. Claims 18, 20-23, 25-27 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Uehara et al (U S 4,961,177). 

Regarding claim 18, Uehara discloses a system for inputting a voice through a 
microphone comprising: 

capturing a video image of a speaker (fig. 2 element 22 and column 2, lines 60- 

62); 

receiving audio information from the speaker through at least one 
microphone(fig. 2 element 12 and column 2, lines 60-62). 
However, Ippolito is silent about specific details regarding the steps of: 

determining the user's position using said captured video image of said speaker; 
(column 2, lines 62-65; fig. 3); and, 

based on the user's position, adjusting a characteristic of the microphone 
(column 3, lines 1-11 and 28-32). 
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Regarding claim 20, Uehara discloses a system for inputting a voice through a 
microphone including tracking the user's facial position in two dimensions and 
estimating the user's facial position in a third dimension (column 4, lines 12-22). 

Regarding claim 21, Uehara discloses a system for inputting a voice through a 
microphone including tracking the user's facial position in three dimensions (column 4, 
lines 12-22). 

Regarding claim 22, Uehara discloses a system for inputting a voice through a 
microphone including using a point of source filter to adjust the audio information 
received from the user and providing said adjusted audio information to a speech 
recognition engine (column 4, lines 23-35 and column 5, lines 5-15). 

Claim 23, is similarly analyzed as claims 18 above. 

Claim 25, is similarly analyzed as claims 20 above. 

Claim 26, is similarly analyzed as claims 21 above. 

Claim 27, is similarly analyzed as claims 22 above. 

Other prior art cited 

11. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Girod (6,483,532 B1) disclose a video-assisted audio signal processing system 
and method. 

Correa (6,024,337) disclose a computer monitor utility assembly. 
Drumm (5,426,450) disclose hands-free hardware keyboard. 
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Conclusion 

12. THIS ACTION IS MADE FINAL Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 . 1 36(a). 

A shortened statutory period for reply to this final action is set to expire THREE 

MONTHS from the mailing date of this action. In the event a first reply is filed within 

TWO MONTHS of the mailing date of this final action and the advisory action is not 

mailed until after the end of the THREE-MONTH shortened statutory period, then the 

shortened statutory period will expire on the date the advisory action is mailed, and any 

extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 

the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

Contact Information 

13. Any inquiry concerning this communication or earlier communications from the 
Examiner should be directed to ABOLFAZL TABATABAI whose telephone number is 
(703) 306-5917. 

The Examiner can normally be reached on Monday through Friday from 9:30 a.m. to 
7:30 p.m. If attempts to reach the examiner by telephone are unsuccessful, the 
Examiner's supervisor, Mehta Bhavesh M, can be reached at (703) 308-5246. The fax 
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phone number for organization where this application or proceeding is assigned is (703) 
872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for published 
applications may be obtained from either Private PAIR or Public PAIR. Status 
information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov . Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
Abolfazl Tabatabai 

Patent Examiner ^ 
Group Art Unit 2625 

KANJBHAI WTEL 

January 1 5, 2005 PRIMARY EXAMINER 




